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Abstract 

Given a graph embedded in an orientable surface, a process con- 
sisting of random excitations and random node and face balancing is 
constructed and analyzed. It is shown that given a priori bounds g 
on the genus and n on the number of nodes, one can determine the 
genus of the surface from local observations of the process restricted 
to any connected subgraph which cannot be separated from the rest 
of the graph by fewer than lGg nodes. The observation time and the 
computation time are polynomial in n 9 . 

The process constructs slightly perturbed random "discrete ana- 
lytic functions" on the surface, and the key fact in the analysis is that 
such a function cannot vanish on a large piece of the surface. 



1 Introduction 

At least since Polya (1921) proved that "a drunk man will return home 
while a drunk bird might lose its way forever", it is known that geometric 
properties of the underling space can manifest themselves in the behavior of a 
random process taking place on the space. Moreover, in recent years random 
processes were used to retrieve information about the underlying space. E.g. 
sampling, volume estimates of convex bodies or scenery reconstruction along 
random walk paths (see for instance ||], Q and H). Rather then letting 
a random walker wander around and gather information, in this note we 
would like to study a problem in which a random process is observed locally 
in a fixed bounded neighborhood and still non-trivial global observations 
can be distilled from these observations. 
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In addition to the algorithmic motivation, a reasonable question is as 
follows. Consider a stationary spin system on a graph, such as Glauber 
dynamics for the Ising model on a graph. What properties of the graph can 
be inferred from properties of the process (for instance, are there interesting 
relations between the spectrum of the Ising dynamics and the graph spec- 
trum)? A harder challenge is to infer non-trivial global properties of the 
underling graph from local observations of the process. 

Facing the harder challenge one might devise first custom made variants 
of the standard spin systems, which can be analyzed and provide ways to 
compute global invariants using "physical" systems. 

Indeed, below we will study a reasonably natural and simple process, 
called noisy circulator, with local operations, living on the edges of a graph, 
embedded in an orientable surface. The noisy circulator consists of adding 
mass 1 to randomly chosen edges with a slow rate and balancing the flow into 
vertices or around faces at random and with a faster rate (details below). 
Somewhat surprisingly, it will be shown how to extract, with high probability 
and in polynomial time in the size of the graph, the genus of the surface by 
observing the restriction of the process to a bounded set of edges, depending 
only on an a priori bound on the genus. 

Although from a pure algorithmic view point this first construction might 
be useful and might have some advantages, the point is not to devise the 
optimal ad-hoc distributed algorithm for finding the genus, under certain 
restrictions, but to show how locally observing a simple physical process 
already does that, and to present this computation scheme. It is of interest 
then to construct other examples of similar flavor. 

The structure of the proof is twofold, a topological theorem and a sta- 
tistical element. 

In the topological part, we study discrete holomorphic 1- forms which we 
refer to here as smooth circulations: circulations that are also circulations 
on the dual map. These can be considered as discrete analogues of analytic 
functions. Every homology class of circulations contains exactly one smooth 
circulation, so the dimension of their space can be used to find the genus 
of the surface. A key result (Theorem 0), which is of independent interest, 
asserts that every connected piece of the vanishing set of a smooth circu- 
lations can be separated from the rest of the graph by a small number of 
nodes. 

In the statistical part, we determine the dimension of the space of smooth 
circulations from observations which can be considered as samples of smooth 
circulations restricted to a bounded set of edges with additional noise. 

The next subsection contains some necessary definitions and basic propo- 
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sitions; in subsection 1.2 a description of the process and a formulation of 
the main theorem are given, together with an outline of the proof. The rest 
of the paper contains the proof of the main theorem. In the final section we 
end with some further comments and problems. 



1.1 Circulations and homology 

Let 5 be a closed compact surface, and consider a map on S, i.e., a graph 
G = (V, E) embedded in S so that each face is a disc. We can describe the 
map as a triple G = (V, EjJ 7 ), where V is the set of nodes, E is the set of 
edges, and T is the set of faces of G. We fix a reference orientation of G; 
then each edge e E E has a tail t(e) € V, a head h(e) E V, a right shore 
r(e) E J 7 , and a left shore 1(e) E T. 

The embedding of G defines a dual map G* . Combinatorially, we can 
think of G* as the triple (J-, E, V), where the meaning of "node" and "face" , 
"head" and "right shore" , and "tail" and "left shore" is interchanged. (Tak- 
ing the dual of the dual will give the original map with every edge reversed; 
this should not concern us in this paper.) 

For each edge e, let Xe E M. E be the unit vector that is 1 on e and 
elsewhere; we define Xv E M. v for v E V and xf E for F E T analogously. 

For each node v, let 5v E M. E denote the coboundary of v. 

{1, if h(e) = v, 
-1, iit{e) = v, 
0, otherwise. 

Thus \5v\ 2 = d v is the degree of v. A vector <p E M s is a circulation if 

e: h(e)=v e: t(e)=v 

For every face F E J-, we denote by dF E M B the boundary of F: 

( 1, ifr(e)=F, 
(dF) e = <^ -1, if/(e) = F, 
[ 0, otherwise. 

Then dp = \dF\ 2 is the length of the cycle bounding F. 

Each vector dF is a circulation; circulations that are linear combinations 
of vectors dF are called null-homologous. Two circulations 4> and <f>' are 
called homologous if </> — 0' is null-homologous. 



3 



Let (ft be a circulation on G. We say that <ft is smooth if for every face 
F € .T 7 , we have 

• &F = 0. 

This is equivalent to saying that (ft is a circulation on the dual map G*. 

Remark 1 Smooth circulations are closely related to discrete analytic func- 
tions and are essentially the same as discrete holomorphic 1-forms. These 
functions were introduced for the case of the square grid a long time ago 
[|/. For the case of a general planar graph, the notion is implicit in ||/. 
For a detailed treatment see ^J^/. 

To explain the connection, let (ft be a smooth circulation on a graph G 
embedded in a surface. Consider a planar piece of the surface. Then on the 
set T' of faces contained in this planar piece, we have a function a : J-' — > R 
such that da = (ft, i.e., (ft(e) = o{r(e)) — cr(l(e)) for every edge e. Similarly, 
we have a function n : V' — > R ( where V is the set of nodes in this planar 
piece), such that 5tt = (ft, i.e., (ft{e) = ir(t(e)) — ir(h(e)) for every edge e. 
We can think of tt and a as the real and imaginary parts of a ( discrete ) 
analytic function. The relation 5ir = p(ft is then a discrete analogue of the 
Cauchy-Riemann equations. 

Thus we have the two linear orthogonal subspaces A C W E generated by 
the vectors 5v (v S V) and B C W E generated by the vectors dF (F e f). 
Vectors in B are 0- homologous circulations. The orthogonal complement 
A 1 " is the space of all circulations, and B^ is the space of circulations on the 
dual graph. The intersection C = A^^B^, the space of smooth circulations. 
So M. E = A © B © C. From this picture we conclude the following. 

Proposition 2 Every circulation is homologous to a unique smooth circu- 
lation. 

It also follows that C is isomorphic to the first homology group of S (over 
the reals), and hence we get the following: 

Proposition 3 The dimension of the space C of smooth circulations is 2g. 

1.2 Main result: randomized circulations 
1.2.1 The Noisy Circulator. 

We consider the following process on G. Let p > be fixed. We start with 
the vector x = E R . At each further step, the following two operations 
are carried out on the current vector x € M. E : 
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(a) [Node balancing.] We choose a random node v, and subtract from x 
the vector (x T 8 v /d v )5 v . 

(b) [Face balancing.] We choose a random face F, and subtract from x 
the vector (x T dp/dF)dF- 

In addition, with some given probability p > 0, we do the following: 

(c) [Excitation.] We choose a random edge e, and add 1 to x e . 

Immediately after a node balancing step, the node v just balanced sat- 
isfies the flow condition; a subsequent other node balancing, or a face bal- 
ancing may destroy this. Smooth circulations are invariant under node and 
face balancing, and we'll see that under repeated application of (a) and (b), 
any vector converges to a smooth circulation. 

1.2.2 Observing the process. 

Let U C V{G) induce a connected subgraph of G, and assume that every 
cycle that separates 5 into two parts S± and 52 so that Si contains U and 
52 is not a disc, has length at least 16g. We show that if p is small enough, 
then observing the process on the edges incident with U long enough, we 
can determine the genus of the surface. 

To be precise, we do the following. Let Eq be the set of edges incident 
with U. Let x(t) € M. E be the vector after t steps, and let y(t) be the 
restriction of x(t) € M. E ° to the edges in Eq. So we can observe the sequence 
random vectors y(0), y(l), .... 

Let x'(t) be the projection of x(t) onto C, and let y'(t) be the restriction of 
x'(t) to the edges in Eq. Because of the random steps (c), after a sufficiently 
long time, the vectors cc'(0), . . . , x'[t — 1) will span the space C. So the rank 
of this set of vectors gives us the dimension of C, and so by Proposition 
||, it gives us the genus of the surface. The restriction to Eq is one-to-one 
on C (Theorem 0), and so this rank is the same as the rank of the set 
W(0),y'(l),...,y>(t)}. 

Unfortunately, we cannot observe the vectors y'(t), only the vectors y(t). 
But (at least if p is small) we expect y(t) to be close to y'(t). Indeed, the 
"errors" x"(t) = x(t) — x'(t) are in A® B, and therefore it is not hard 
to show that they tend to exponentially fast (see lemma [l(] below), at 
least as long as no "excitation" step (c) occurs. Therefore the restrictions 
y"(t) = y(t) — y'(t) also tend to exponentially The speed of convergence 
depends on the eigenvalue gap of the transition matrix of the (undirected) 
random walk on G and G* . 
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So we are lead to the following standard statistical problem: there is a 
sequence y'(0),y'(l), ... of vectors in IR fc , which span a linear subspace L. 
We observe the sequence y(t) = y'(t)-\-y"(t), where the "error" y"(t) is small 
on the average. We want to find the dimension k. 

The random vectors x(t) or x'(t) are not independent; but if we take 
the differences, i.e., we look at the vectors x'(t + 1) — x'(t), then these are 
independent (in the probabilistic sense). Indeed, node balancing and face 
balancing don't change x'\ so if no excitation occurred in step t + 1, then 
x'(t + 1) — x'(t) = 0, and new flow was created on edge e, then it depends 
only on e. Hence the vectors y'{t + 1) — y'{t) are also mutually independent. 

Since y'{t + 1) — y'(t) = with probability 1 — p, it makes sense to 
aggregate N = 1/p of these terms to one. So we consider the vectors z(t) = 
y(Nt) - y(N(t - 1)) and z'(t) = y'(Nt) - y'(N(t - 1)). Then the vectors 
z'{t) are mutually independent samples from some distribution on L. (The 
errors z"{t) = z(t) — z'(t) may be dependent.) 

A further difficulty is that if an excitation step occurs close to the end 
of an aggregated interval [N(t — 1) + l,Nt], then the error z"{t) can be 
larger than the main term z'(t). This happens with small but not negligible 
probability. We handle this by randomly selecting just a fraction of these 
intervals, so that the probability of any of these bad large errors occurring 
is small. 

To formalize, we propose the following algorithm to recover the dimen- 
sion of C. 

Genus estimate. We assume that we are given upper bounds n > n+m+f 
and g > g. Let rriQ be the number of edges incident with U . Set 

T' = 6(n + g), e = n~ m ° w , T = AT' 2 . 

Construct a sequence of integers ti, t2, ■ . . , € [0, T — 1] as follows. If we have 
ti, t2, ■ ■ ■ , ifc, then compute the linear hull C(k) of z(ti), zfa), ■ ■ ■ , z{tk)- Let 
H(k) be the set of integers t £ [0, T — 1] for which the vector z(t) is farther 
from C(k) than e. If \H(k)\ < T', then return k/2 as your guess for g. Else, 
choose a number tk+i € H(k) randomly and uniformly. If k becomes larger 
that 2g, declare the procedure a failure and stop. 

The main result of this paper is the following. 

Theorem 4 The Genus Estimate Algorithm returns the correct genus with 
probability at least 2/3. 
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If you find that a success probability of 2/3 is not reassuring enough, 
independent repetition of the observation can boost this arbitrarily close to 
1. 

2 Properties of smooth circulations 
2.1 Harmonic functions 

It is an easy well-known fact that if G = (V, E) is a connected graph (the 
orientation is not needed right now), then for any two nodes a,b £ V there 
is a vector tt G such that for every node v, 

( 1, ifv = b, 
^ n u -d v TT v = < -1, if v = a, (1) 
u: uveE { 0, otherwise. 

This last expression is equivalent to saying that f e = Hh(e) ~ n t(e) 1S a flow. 
We denote this vector it by 7r a ^ if we want to express that it depends on a 
and b; if there is an edge e with h(e) = b and i(e) = a, then we also denote 
vr a ,b by 7r e . 

The vector ir is not unique; we can add the same scalar to each entry. 
For our purposes, it will be convenient to choose it so that 

5> n = 0. (2) 

u 

There are many interpretations of these functions; for example, let the 
graph represent an electrical network, with the edges having unit resistance. 
Send a unit electric current from b to a. Then the potential of u is ir u . The 
function ir a ^ is often called a harmonic function with poles a and b. 

In terms of the Laplacian L of the graph, the equations (|]) can be written 

as 

Lit = Xb~ Xa- 

The matrix L is not quite invertible, but it has a one-dimensional nullspace 
spanned by the vector 1 = (1,...,1) T , and so it determines 7r up to adding 
the same scalar to every entry. We assumed in (|2|) that l T 7r = 0. If J G 
]g>VxV denotes the all-1 matrix, then 

(L+ J)vr = Lit = Xb~ Xa, 

and so we can express it as 

7r = (L + J)- 1 (xb-Xa)- (3) 
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We can use harmonic functions to give a more explicit description of 
smooth circulations in a special case. For any edge e of G, let rj e be the 
projection of Xe onto C. 

Lemma 5 Let a,b € E be two edges of G. Then 

( ) = { (^W) ~ ( 7r &)t(o) + ^b)r{a) ~ (K)i(a) + 1 , ifa = b, 

{Jla)b \{Kb)h{a)-{Kb)t{a) + {K)r{a)-{K)l(.a)i otherwise. 

Proof. Let x\ , X2 and X3 be the projections of Xb on the linear subspaces A, 
B and C, respectively. The vector x\ can be expressed as a linear combination 
of the vectors Sv (» £ V), which means that there is a vector y £ MX so that 
xi = My. Similarly, we can write X2 = Nz. Together with x = £3, these 
vectors satisfy the following system of linear equations: 

(x + My + Nz = Xb 

I M T x = (4) 
I N T x = 

Multiplying the first m equations by the matrix M T , and using the second 
equation and the fact that M T N = 0, we get 

M T My = M T Xb , (5) 

and similarly, 

N T Nz = N T Xb - (6) 

Here M T M is the Laplacian of G and N T N is the Laplacian of G*, and so 
(||) implies that y = 7Tb + cl for some scalar c. Similarly, z = tt^ + c*l for 
some scalar c'. Thus 

x = Xb ~ M T (7T b + cl) - iV T « + c*l) = Xb ~ M T 7r b - N T ir* b , 

which is just the formula in the lemma, written in matrix form. □ 



2.2 Nondegeneracy properties of smooth circulations 

We state and prove two key properties of smooth circulations: one, that the 
projection of a basis vector to the space of smooth circulations is non-zero, 
and two, that smooth circulations are spread out essentially over the whole 
graph in the sense that every connected piece of the graph where a non-zero 
smooth circulation vanishes can be isolated from the rest by a small number 
of points. 
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Theorem 6 If g > 0, then for every edge e, rj e ^ 0. 

Proof. Suppose that r\ e = 0. Then by Lemma |j| there are vectors 
vr = vr(e) G M. v and vr* = vr*(e) G R F such that 

^h(a) ~ T*(o) = <(a) - "?(«) (7) 

for every edge a ^ e, but 

Tfc(<0 - vr t(e) = 1 + 7T* (e) - vr; (a) . (8) 

For convenience, we orient every edge so that ft*r a \ > ^(a) - 

Let «i < «2 < • • • < oik be the different values of vr*, and let T{ = {F G 
F : vr^ = aj. Since for every face F other than r(e) and 1(e) the value of 
TTp is the average of its values on the neighbors, it follows that T\ = {1(e)} 
and T k = {F\r(e)}. 

Consider the union Si of all faces in T\ U . . . U Fi, 1 < i < k. We claim 
that its boundary is a single cycle containing the edge e. It is trivial that 
e C dSi. Suppose that the boundary of Si contained a cycle C that did 
not go through e, which is a directed cycle with Si being on its left hand 
side. But then > vr^ for every edge of C. By ((^), this means that 
7r h(a) > 7Tt(a) for every edge a of C, which is clearly impossible. 

Thus we know that dSi consists of a single cycle through e; in other 
words, it consists of e and a path Pj connecting £(e) and /i(e). As above, it 
follows that Pi is a directed path from h(e) to i(e), and vr„ strictly increases 
as we traverse the path. 

Consider the two paths Pj_i and Pj. These may have common nodes 
besides their endpoints, but from the fact that vr strictly increases along 
both of them, it follows that their common nodes are in the same order on 
both paths. This implies that the two paths have the following structure: 
there are common (directed) subpaths Qo, Q\, . . . , Q r (possibly consisting of 
just a single node), and cycles C%, C2, ■ ■ ■ , C r so that Cj is the union of two 
directed paths Rj and R 1 -, connecting the endpoint of Qi-i to the starting 
point of Qi, with Rj C P i _ 1 and R'j C Pj. 

The union of all faces F G J~i is a (not necessarily connected) surface S\ 
whose boundary is the union of the cycles Cj. 

Claim. S[ consists of r discs D\, . . . D r , where the boundary of Dj is the 
cycle Cj. 

Let G[ be the subgraph of G contained in S 1 ,- (including the nodes on the 
boundary, but not the edges). Let H be a connected component of C . Since 
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every edge of G' has it^u) = ^t(a) by (0), all nodes of H have the same ir 
value. Since n is strictly increasing along Pi-i as well as along Pj, it follows 
that H can be attached to at most one node on each of these paths. By 
the 3-connectivity of G, it follows that H is a single edge connecting some 
u € V(Pi-\) and v € V{Pi). From ir u = ir v it also follows that H connects 
a node on an Rj to a node on the corresponding Rj, and the endpoints of 
these edges are in the same order on both paths, which gives an ordering of 
the edges of G' . 

Now every face in T\ is a disc attached along a cycle in Pi-\ U Pi U G' , 
so that the edges in G' have two faces attached, while the edges of Pj-i U Pi 
have only one. The only way to this is to attach each face in Ti to 

— two consecutive edges of G' and to the to subpaths of G between their 
endpoints, or 

- to the first or last edge of G' connecting two points on a cycle Cj and 
to the corresponding arc of Cj. 

Hence the Claim follows immediately. 

Now we see that Si is obtained from Si-i by attaching the disjoint disks 
Dx,...,D r along single arcs Ri, . . . , R r . Since S± is a disc, it follows by 
induction that Si is a disk, and in particular, Sk-i is a disc. Since S is 
obtained from Sk~i by gluing in the last face (a disc) along the cycle dS^-i, 
it follows that S is a sphere, and so g = 0. □ 

Theorem 7 Let G be a graph embedded in an orientable surface S of genus 
g so that all faces are discs. Let h be a non-zero smooth circulation on G 
and let G' be the subgraph of G on which h does not vanish. Suppose that h 
vanishes on all edges incident with a connected subgraph U of G. Then U 
can be separated from G' by at most 16g points. 

The assumption that the connectivity between U and the rest of the 
graph must be linear in g is "essentially" sharp in the following sense. Sup- 
pose that G has a cutset of 2g — 1 or fewer edges. Since the dimension of 
the space of smooth circulations is 2g, there will be a non-zero smooth cir- 
culation h vanishing on these edges. If this circulation is non-zero on (say) 
the left hand side of this cut, then we can replace all 

Before the proof of this theorem we need a simple lemma about maps. 

Lemma 8 Let G be any digraph embedded on an orientable surface S of 
genus g. Assume that G has no sources and sinks. For every face F, let qf 
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denote the number of nodes v on the boundary of F for which the two edges 
on the boundary of F incident with v are directed to v. Then 

^(a F - 1) < 2g - 2. 

F 

Proof. Let n, m and / denote the number of nodes, edges and faces. 
Clearly of is also the number the nodes on the boundary of F with both 
edges oriented out, and so Y2f ^ a F counts the number of "corners" with both 
edges oriented in or both edges oriented out. Since there are no sources or 
sinks, at every node there are at least 2 corners with one edge in and one 
out, and hence 

V 1a F < y^idy ~ 2) = 2m - 2n = 2/ + 4g - 4. 

F v 

Rearranging and dividing by 2, we get the inequality of the lemma. □ 

Corollary 9 // there is no face whose boundary is a directed cycle, then 
of < 2^r — 1 for every face. 

Now we are ready to prove Theorem 0. 
Proof. By re-orienting edges of G' we may assume that h > on the 
edges G' . Let us shrink every connected component of G\V(G') to a single 
point, to get a digraph G" embedded in S. Note that every face of G" is a 
disc and it contains at least one edge of G', and so it cannot be bounded by 
a directed cycle. 

Let w be the node to which U is contracted, let F' be the face of G' 
containing w. We think of F as a surface with a boundary dF consisting of 
k disjoint Jordan curves C\, . . . , glued to some (not necessarily disjoint) 
cycles of the graph G' . 

Let R be a face of G" contained in F' , bounded by a cycle dR. Then 
dF n dR consists of one or more arcs; let 6r denote the number of these 
arcs. 

Let rj denote the number of faces R with 6r = i, and let r[ be the 
number of faces among these which touch w. 

First we estimate r%. Along the common arc of dF and dR, the direction 
of the edges must change at least once: else, h would add up to a non-zero 
value along the face boundary dR, contradicting the assumption that h is 
smooth. Hence by Lemma |9| r% < 2qf' < 4g — 2. 



11 



Consider a face R with 6r > 1 touching w. Select a node vr in the 
interior of R, and connect it inside R by disjoint arcs to w and to one point 
on each of the arcs of dF n dR. Call these arcs red. 

Cut away all of the surface outside F', and contract each Cj to a single 
point. Also contract to a single point. This way we get a surface S* with 
genus g*. The red arcs form a graph G* embedded in S*. It is clear that 
G* is bipartite. 

Next, we estimate the number /| digonal faces of G*. Indeed, a digonal 
face must be formed by two red arcs originally connecting a node vr to two 
points a and a' on different common arcs A and A' of <9i? and a Cj, and by 
a disc R' bounded by these two red arcs and an arc B of Cj connecting a 
and a'. Since A and vl' are distinct arcs, there must be a face of G inside 
R' attached to at least one edge of Cj. In fact, there must be such a face 
Rq that is attached to G along a single arc. Clearly, Rq is disjoint from W 
and so it is counted in n — It is also clear that different digonal faces 
of G* correspond to different faces counted in n — r[. This proves that 

f 2 'i- 

Now use Euler's formula. The number of nodes of G* is 1 + k + where 
the 1 accounts for w, k is the number of components Cj, and k" = Xli>2 r i 
is the number of faces of G meeting W. The number of edges is m* = 
^ i>2 (i + l)r^. So the number of faces is 

f* = Y,(i + l)r' i - [\ + k + Y j r'i\+2g*-2. 

i>2 \ i>2 J 

There are f£ digons and /* — /| faces that are all at least 4-gons, and hence 
2m* > 2/ 2 * + 4(/ * -P 2 ) = 4/* - 2/ 2 * > 4/* - 2(n - ri) > 4/* - 4 5 - 2 + 2r' l5 
and so 

5^(i + l)r< = m* > 2/*-4 5 -2 + ri = r' 1 + ^2ir' i -2k-2-Ag* + A-Ag + 2 

i>2 i>2 

whence 

r[ + - l)r- < 2k + 4 5 * + 4g - 4. 

i>2 

Let denote the number of neighbors of u> on dF. Each neighbor of w 
is incident with at least two faces of G" containing w and an arc of dF, and 
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a face containing i arcs of dF is counted at most 2i times this way; hence 

2N < 2ir- < 4 I r' x + - l)r- ] < 8/c + I65 + I65* - 16. 

To conclude the proof, it suffices to show that 

2g * +k < 2g (9) 

unless g = g* = and k = 1. To this end, notice that no cycle Cj bounds a 
disk not containing u>. Indeed, the restriction of h to the connected compo- 
nent of G' contained in this disk would be a smooth circulation on a planar 
map, contradicting Lemma || 

If k = 1 and C\ bounds a disc containing w, then g* = and (S) is 
trivial. In every other case, none of the cycles C{ is null- homologous. 

Let us cut the surface 5 along each Cj one by one, and in each case, glue 
discs on both copies of Cj obtained by cutting. If this operation separates 
the surface, then we keep the part containing w only. This way we obtain 
the surface S*. 

Every cut reduces the genus of the surface, except when it separates the 
surface and the part thrown away is a sphere. Since each such part must 
have been attached along at least two cycles Cj, at least half of the cuts 
reduce the genus, showing that g* < g — k/2. This proves (^) and completes 
the proof of Theorem ^. □ 



3 Proof of Theorem g 

3.1 A lemma about convergence 

We prove a simple lemma about the convergence of a simple randomized 
iterative projection process. 

Let ai, . .. ,df. € 1", and let A = (l/k) Y2i=i a i a J ■ Let A be the small- 
est positive eigenvalue of A. Let L be the linear subspace generated by 
01, . . . , afc, and L- 1 , its orthogonal complement. 

For x G M. n , define a Markov chain X°,X 1 ,... G W 1 as follows: start 
with X° = x. Given X', choose a vector ai (uniformly and randomly), and 
let 

X t+l =X t_%^ a . 

a? ai 
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Let y and z be the orthogonal projections of x onto L and L, respec- 
tively. Then X 1 — > y, and in fact the following lemma describes the rate of 
convergence: 

Lemma 10 For every x GW 1 , 

E(|X'-y| 2 )<(l-A)'M 2 . 
Proof. First we consider the case t = 1: 



and hence 




Since z G L is in the range of A, we have z T Az > X\z\ 2 , which proves the 
assertion. 

The general case follows by induction. □ 



3.2 Setup for the proof 

Let a,i G R E be defined for every node i by 

1, if i is the head e, 
(aj) e = ^ —1, if i is the tail of e, 
0, otherwise. 



Furthermore, let bp G K E be defined for every face F by 

(Me 

We consider the matrices 



' 1, if e is an edge of dF oriented clockwise, 

— 1, if e is an edge of dF oriented counterclockwise, 
k 0, otherwise. 



r) ^ — J \n ■ 



19 ) 
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and 

Let Ai and A2 be the smallest positive eigenvalue of A and B, respectively, 
and let \i = min{Ai, A2}. 

Let Tt : M> E — > M> E denote the (random) linear mapping that (a) and 
(b) generate in step t. Note that subspace C is invariant under T t . Let 
W(t,s) = T s T s _ 1 ...T t+1 , 

, > _ J ej, if in step t edge j was excited, 
[0, otherwise. 

and u(t,s) = W(t, s)u(t). Hence 

s 

K 8 ) «(*,«). 



t=0 



Let ui(t, s), U2(t,s) and 143(4, s) denote the orthogonal projections of 
u(t,s) to A, B and C, respectively, and let Ui(t) = U{(t,t). This notation 
is somewhat redundant, since C is invariant under Tt, and hence us{t,s) = 
us(t) for every s > t. The "error part" is w(t,s) = u\(t,s) +U2(t,s). Thus 
we get the "smooth part" 

s 
4=0 

and the "error part" 

s 

x /7 ( S )= J>(t jS ). (10) 
4=0 

We need to bound x' from below and x" from above. 



3.3 Bounding smooth circulations 

The vector 143(4) is a smooth circulation. Suppose that it is not the circu- 
lation. Then by Theorem ^, we know that its restriction v(t) to Eq is not 
the circulation. We need a lower bound on \v(t)\. Recall that / = 
then we have 

Lemma 11 

W)\>n- n r f . 
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Proof, a; is a rational vector whose denominator is a divisor of det(M T M+ 
J) det(iV T N + J). By Hadamard's inequality, the denominator of x is at 
most n n f* . 

Now v(t) is a restriction of x which is nonzero, and so at least one 
coordinate of v(t) is a non-zero rational number with denominator at most 
n n ff. This proves (0). □ 



3.4 Bounding the error 

We prove a bound on the "error term". More exactly, we fix an integer 
a > 0, and split the error into "old errors" and "new errors" : 

s— a s 

x"(s) =J^u>(t,s)+ Yl l w(t,s)=X 1 (s,a)+X 2 (s,a). 



t=o 



t=s— a+1 



First we estimate the expectation of |Xi(s)| 2 . We claim that 



EHX^s, a)H< 5^(1 -aO" 



Let us fix the "excitations" u(0), u(l), . . ., and let E u denote expectation 
conditional on these. Lemma [lC] implies that 



and 
and so 



E u (ki(i,s)| 2 )<(l-A 1 ) s -'|ni(i)| 2 , 
E u (\u 2 (t,s)\ 2 )<(l-X 2 y- t \u 2 (t)\ 2 , 

E u {\w(t,s)\ 2 ) = E u (\ Ul (t,s)\ 2 + \u 2 (t,s)\ 2 ) 

< (l_^(K(t)| 2 + |n 2 (t)| 2 ) 

< (l-^-*|n(t)| 2 . 



(12) 
(13) 



From the definition of Si we have 

E U (|S 1 ( S )| 2 ) = E U 



1 s—a s—a 



i=0 



s—a s—a 



v t=0 t'=0 



t=0 t'=0 
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<EEE u (k(t, S )| 2 )^ Eu ( K t', s )p)V2 = (^(Ht^A . 

4=0 t'=0 \t=0 / 

Using (^), this gives 

E»(|EiOO| 2 ) < (EU - ^ s - t)/2 \u(t)\j • (14) 

Now we take expectation over the sequence u(t). Since the \u(t)\ are inde- 
pendent 0-1 valued variables with mean p, the expectation of the right hand 
side is easy to estimate: 



Bi-/^- t)/2 K*)l 

\t=0 ) 

E " M) (S "* )/2 (1 - M) (s -'' )/2 E(|u(t)||n(t')|). 



s— a s— a 



t=0 i'=0 

Here 

E(Kt)||«(f)|) = 
Thus we can write the sum above as 

s— a s—a 



p, if t = t', 
p 2 , otherwise. 



p 2 E B 1 - M) ( ~* )/a (i - /^ t,)/2 + (p - p 2 ) £u - d-*, 

t=o t'=o t=o 



l_(l_ M )l/2 J < * *> n r, ! _ (1 _ ^ 

<(1-M) B ^ + ^<5^(1-M) B . 
H l p. y. 

This proves (PI). 

Second, we consider X 2 . If no excitation event occurs between times 
s — a + 1 and s, then £2(5) = 0. Hence 

Pr(|S 2 (fl)| > 0) < 1 - (1 - p) a < ap. (15) 

From (0) and © we get that 

Pr(|x"(s)j > 0) < PrflXjOOl > 5) + Pr(|X 2 (s)| > 0) < -%(1 - /x) a + ap. 
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The choice a = ~ In y gives the best bound (up to a constant): 



Pv(\x"(s)\ > 5) < ^ln~. 

fl o 



(16) 



3.5 Completing the proof 

To simplify notation, put Z{ = z(ti), z\ = z'(ti) and z" = z"(ti). It follows 
from the choice of the integers ij that the vectors z\, z% , . . . Zk are linearly 
independent, and so dim(£(i)) = i. Furthermore, by the selection of these 
vectors, the Gram-Schmidt orthogonalization z* = z±, z%, ■ ■ ■ , zt consists of 
vectors of length at least e. 

Let 5 = (e/4)(l + l/e)~ 2ff °. Since \z"(t)\ < \x"(t)\, the probability that 
> 5 is less than ^ In | by (|16|), so the probability that any of the ti 
have |z"(£j)| > 5 is less than 



lOp , 1 1 

fjL 9 

So with probability at least .99, we have \z"\ < 5 for every i. Let us assume 
that this occurs. 

Claim 1. Suppose that for some real numbers a%, . . . ,atk, we have 



£ 

i=l 



OLnZi 



< 1. 



(17) 



Then 



1 / 1 

N < - i + - 

e \ e 



k—i 



Indeed, (FT?]) implies that for any 1 < j ' < k, 



ET * 
OLiZi Zj 



1=1 



< Z 

- \ z 3 I- 



Since z) z* = if i < j and z\ z*a = \z*A 2 , it follows that 



I II * 1 2 ■ I * t \ A i T ,* i ^ i * i i X 

\ajWzj | < | Zj | - ^ ^ \<\z j \ + 2^ 



ai z- 



i=3 



i=3 
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Dividing by \z*\ 2 and using that \zi\ < 1, we get that 



3 i=j i=j 

Hence the claim follows by induction on k — i. 

Claim 1. The vectors z[,. . . ,z f 2g are linearly independent. 

Indeed, assume that there is a linear relation Y2i a i z [ = where not all 
the cti are 0. We can write this as Yli a i z i = *l2i a i z i- Since the Zi are 
linearly independent, the left hand side is non-zero, and so we may assume 
it has norm 1. But then Claim 1 implies that \a.i\ < (l/e)(l + l/e) k ~ % , and 
so using that \z{\ < 5, 



< 



fX>l<«(i + ^) fc <i, 



a contradiction. 

Claim 3. d(z(t),C(k)) < \z"(t)\ + §. 

Indeed, by Claim 2 we can write z'(t) = Yli=i a i z i with some real num- 
bers «j. Then 

k k 

^ aiZi = z\t) + ^ Oiz". 
i=i i=i 

Let R be the norm of the vector on the two sides. Then by Claim 1, we 
have |Qj| < R(l/e)(l + l/e) h ~ l . Using this, we get 



R 



i=i 



< I - M (1 + 1) < 1 + Ir, 



whence R < 2, and so < (2/e)(l + l/e) fc ^. 
Let w = J2i=i UiZi € C(k). Then 



Iw-^t)! = 



1=1 



fc / 1\ fc 

<<$£K| <2<5(l + -J <e/2, 
i=i ^ ' 



and so 



d(z(t),C(k)) < \z(t)-w\ < \z{t) - z'{t)\-\z'{t) -w\ < \z"(t)\ + -, 



as claimed. 
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Now we are ready to complete the proof. In guessing that k/2 is the 
genus of the surface, we can err in two directions: it may be that g < k/2 
or g > k/2. We estimate the probability of these errors separately. 

If g < k/2, then we moved on when we had k = 2g, which means that 
at that stage we had too many vectors z(t) farther from C(k) than e. By 
Claim 3, every such t must satisfy > e/2 > 5. By (|T^), the probability 

that a given t has this property is less than In y, and so using Chernoff 's 
inequality, the probability that the number of indices t with this property is 
at least T' is less than 1/9. 

If g > k/2, then we stopped too early: there was a subspace V with 
dim(L') < dim(L) so that almost all the z(t) were closer to L' than n _10n . 
The mapping M s — > £ obtained by projecting W E to C and then restricting 
it to Eq is surjective; this implies that there are 2g edges e%, . . . , ei a € E 
so that the vectors n(e%), ■ ■ ■ ,^{fi2g) form a basis in C. These vectors are 
rational with denominators at most n n ff, so the determinant of this basis 
is at least n~ m ° n f~ m °f > e. It follows that at least one of these vectors, say 
yu(ei), is at distance at least e from C. 

Thus if in any of the time intervals \{t — 1)N + l,tN], the edge e% was 
excited, no other edge was excited, and the excitation of ej occurred in the 
first N — a steps in this time interval, then z(t) must be farther from £' than 
e. The probability that this happens for a given t is 

so (using the Chernoff bound again) the probability that this happens for 
fewer than T' choices of t is less that 1/9. 

To sum up, the total probability of "bad" cases is less than 1/9 (when 
\zi\ > 5 for some i) plus 1/9 (when g < k/2) plus 1/9 (when g > k/2). This 
proves the theorem. 

4 Concluding remarks 

1. We can make some cosmetic changes to the setup as given above. One 
objection may be that the noisy circulator, as defined, is not truly local, 
since (say) in operation (a) we have to select a node uniformly from all 
nodes. The standard way of fixing this is to attach an "alarm clock" to each 
node, edge, and face, which wakes them up at random times according to a 
Poisson process (the edge-clock is much slower than the other two). 

Another objection is that the noisy circulator as constructed above is not 
stationary: the total mass grows to infinity. An easy fix is to give a second, 
even slower clock to each edge: when this rings, they reset their value to 0. 
Another possible fix comes from the observation that the excitations don't 
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necessarily have to be constants, the proof works just as well for random 
and symmetric excitations. So modifying the excitation step to reset the 
value of any edge to 1 with very small probability (rather than to add 1) 
provides a stationary version (but the analysis becomes more complicated). 

Further variants, improvements and generalizations of the above system 
are of interest: 

• Can one recover the genus even if the rate at which excitations take 
place is faster (ideally, independent of the number of nodes)? 

• Suppose that we only allow two values (or any other given discrete set 
of values) on any edge. Can we still recover the genus? 

• Can one extend the technique to recover a non-orientable surface by 
observing a random process on an embedded graph locally? 

For background regarding graphs on surfaces and Riemann surfaces see 
for instance [11] pj], for background regarding algorithmic applications of 



random processes see [JjJ . 

2. The notion of global information from local observation can inspire many 
questions in different directions. We briefly present some related examples. 

Example 1 Let G be a finite connected graph. Start a simple random walk 
on G. Fix a vertex v in G. You are given the sequence of times for which 
the simple random walk visits v, what information can be learned about G? 
From the infinite sequence one can reconstruct the on diagonal heat kernel 
and thus the spectrum of the transition matrix of the random walk (so in 
the case of regular graphs, the spectrum of the graph). 



Example 2 [Obtaining the size of the road system by measuring the volume 
of local traffic]. Let G be a finite connected irreducible regular graph. From 
each vertex start an independent simple random walk. Fix a vertex v € G. 
For each time t you can observe the number of walkers occupying v. How 
much time is needed in order to, almost surely, know the size of Gl Assume 
you are given an a priori bound N on n = \V(G)\. Then one can get 
a (poor) polynomial upper bound on the time needed along the following 
lines. The mixing time of G is smaller than n 3 , so if we observe the load on 
v only at times of the form fciV 3 , we get almost independent samples from a 
distribution which is exponentially close to Binomial(n, 1/n), the stationary 
distribution for the number of walkers at v. For Binomial(n, 1/n), about n 4 
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samples are needed to recover n. So after about N 7 steps n can be recovered. 
(Probably iV 4 is the time needed to recover n.) 

There are several random processes on graphs which have been consid- 
ered before, and for which the question "what global information can be 
deduced from local observation" is meaningful. We mention two examples. 

Example 3 Consider an n- vertex connected graph. There are k <n parti- 
cles labeled 1, 2, k. In a configuration, there is one particle at each vertex. 
The interchange process discussed briefly in [|J , is a continuous-time Markov 
chain on configurations. For each edge (i, j), at rate 1 the particles at vertex 
i and vertex j are interchanged. If only one of the two vertices is occupied, 
then it jumps to the other vertex. Assume you observe which labeled par- 
ticle occupies a fixed predetermined vertex at any time. For k = 1, this is 
just Example ||. If k > 1, can one recover, using these observations, further 
information about G not contained in the spectrum? If k = n, is it possible 
to reconstruct Gl 

Example 4 Another natural candidate for local observation is the heat- 
bath chain (Glauber dynamics) on fc-colorings of a graph: at any step, we 
have a (legal) A;-coloring; we select a random node v and a random color a, 
and we re-color v with color a if this gives a legal /c-coloring. Can we derive 
estimates on the chromatic number, or maximum degree, by observing a 
bounded piece of the graph? 
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